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The N-terminal cellulose-binding domains CBD N1 and CBD N2 from 
Cellulomonas fimi ceUulase CenC each adopt a jelly-roll |3-sandwich struc- 
ture with a cleft into which amorphous cellulose and soluble cellooligo- 
saccharides bind. To determine the orientation of the sugar chain within 
these binding clefts, the association of TEMPO (2,2,6,6-tetramethylpiperi- 
dine-l-oxyM-vl) spin-labeled derivatives of cellotriose and cellotetraose 
with isolated CBD N1 and CBD^ was studied using heteronuciear 1 H- N 
NMR spectroscopy. Quantitative binding measurements indicate that the 
TEMPO moiety does not significantly perturb the affinity of the celloo- 
ligo-saccharide derivatives for the CBDs. The paramagnetic enhance- 
ments of the amide l H N longitudinal (Af^) and transverse (AR 2 ) 
relaxation rates were measured by comparing the effects of TEMPO-cello- 
tetraose in its nitroxide (oxidized) and hydroxylamine (reduced) forms 
on the two CBDs. The bound spin-label affects most significantly the 
relaxation rates of amides located at both ends of the sugar-binding cleft 
of each CBD. Similar results are observed with TEMPO-cellotriose bound 
to CBD N1 . This demonstrates that the TEMPO-labeled cellooligosacchar- 
ides, and by inference strands of amorphous cellulose, can associate with 
CBD N1 and CBD N1 in either orientation across their p-sheet binding clefts. 
The ratio of the association constants for binding in each of these two 
orientations is estimated to be within a factor of five to tenfold. This find- 
ing is consistent with the approximate symmetry of the hydrogen-bond- 
ing groups on both the cellooligosaccharides and the residues forming 
the binding clefts of the CenC CBDs. 
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i * m Minn inal cellulose-binding domains, CBD Hl and 

introduction CBD N ,(Tomme et ai, 1995). In contrast to most 

The B-l 4-elucanase CenC from Cellulomonas fimi CBDs "which bind crystalline ^cellulose, the homolo- 
is mJulZ 1 Structure and function (Coutinho gous family IV CBD^ and CBD N2 from CenC are 
et al 1992 1991) Its central catalytic domain is unique in their specificity for amorphous cellulose 
connected by a short linker to two tandem N-term- and soluble cellooligosaccharides. As part of a col- 
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laborative effort to exploit these domains for use in 
biotechnology, we have used NMR spectroscopy 
and calorimetry to provide a structural and 
thermodynamic foundation for understanding the 
distinct binding properties of CBD N1 and CBD N2 
(Creagh et aL, 1998; Johnson et aL, 1996a,b, 1998; 
Tomme et aL, 1996a). 

CBD N1 (Johnson et aL, 1996b) and CBD^ 
are composed of two five-stranded antiparallel 
(E.B., P.EJ. & L.P.M., unpublished results) (3-sheets 
that fold into very similar jelly-roll sandwich struc- 
tures. As demonstrated by NMR chemical shift 
perturbations and the observation of intermolecu- 
lar protein-sugar NOEs, a single cello- 
oligosaccharide molecule binds to these CBDs 
within a groove or deft that runs across one 
(3-sheet face of each protein domain (Johnson et aL, 
1996a,b). The presence of a cleft readily explains 
the binding specificity of these CBDs. Soluble cel- 
looligosaccharides and single polysaccharide 
chains, as might be encountered in the amorphous 
regions of cellulose, bind within this cleft, whereas 
flat crystalline arrays of cellulose are excluded. 
In agreement with the observation that the affinity 
of CBD N1 for celiooligosaccharides increases in 
the order cellotriose < cellotetraose < ceilopentaose 
~ cellohexaose, the sugar binding sites in the CenC 
CBDs can each be spanned by approximately five 
glycosyl units (Johnson et aL, 1996a,b; Tomme et aL, 
1996a). 

In parallel with these structural studies, a 
detailed calorimetric analysis of the association of 
CBD N1 with polysaccharides was also conducted 
(Creach et aL, 1998; Tomme et aL, 1996a). CBD^i 
was found to bind regenerated (phosphoric acid 
swollen) cellulose and other soluble 0-1,4-linked 
polymers of glucose, such as hydroxyethyl cellu- 
lose and barley and oat-|3-glucan, with affinities 
equal to those measured for ceilopentaose and 
cellohexaose. Formation of the CBD N1 -sugar com- 
plexes is favored enthalpically, indicating that 
hydrogen bonding and van der Waals interactions 
provide the primary driving force for the binding 
event. Inspection of the structures of CBD N1 and 
CBD^ reveals that their binding clefts consist of a 
central strip of hydrophobic side-chains, flanked 
on both sides by polar residues. This led us to 
propose that the'pyranose rings of cellooligosacch- 
harides, and by inference single polysaccharide 



chains in regions of amorphous cellulose, are 
stacked: against the hydrophobic strip, while the 
flanking hydrophilic residues provide hydrogen 
bonds to the equatorial hydroxyl groups of the 
sugar (Johnson et aL, 1996a,b). Such interactions 
are often observed with carbohydrate-binding pro- 
teins (Quiocho, 1989, 1993; Vyas, 1991). 

The structures of CBD Nl and CBD^ were calcu- 
lated using NMR data collected for these protein 
domains in the presence of saturating concen- 
trations of cellotetraose or ceilopentaose, 
respectively Gohnson et aL, 1996b). Numerous 
intermoiecular nuclear Overhauser effects (NOEs) 
between the 13 C/ 15 N-labeled proteins and the 
unlabeled celiooligosaccharides were observed in 
isotope edited /filtered NOESY experiments 
(Otting & Wuthrich, 1990). However, the NMR 
spectra of the celiooligosaccharides are highly 
degenerate, thus preventing the unambiguous 
assignment of these NOEs to specific interaction* 
between sugar and protein protons. This precluded 
the direct determination of the structures ot the 
CBD-cellooligosaccharide complexes. 

As a first step towards achieving this goal, we 
have focussed on defining the orientation of a cel- 
lulose chain within the binding clefts of CBD N1 
and CBD N2 . Exploiting a novel glycosynthase tech- 
nology (Mackenzie et aL, 1998), we have prepared 
derivatives of cellotriose and cellotetraose with a 
Z^^-tetramethylpiperidine-l-oxy-^-yl (TEMPO) 
spin-label covalently attached to the reducing end 
of the sugar (Figure 1). The use of such paramag- 
netic relaxation probes has a long history in the 
study of biological macromolecules by NMR spec- 
troscopy (Kosen, 1989). The nitroxide moiety con- 
tains an unpaired electron that provides an 
efficient mechanism for the relaxation of neighsc- 
ing nuclei via dipolar coupling. Due to the magni- 
tude of the electron magnetic moment, this 
interaction extends to over 20 A. In contrast, pro- 
ton-proton NOEs are limited to separations of less 
than -5 A. Thus, these TEMPO-labeled sugars can 
be utilized to obtain long-range distance infor- 
mation about the CBD-cellooligosaccharide com- 
plexes. 3n practice, the excellent dispersion of the- 
signals 3rom the amide *H- 15 N groups, combined 
with the high degree of sensitivity of the HSQC 
experiment (Cavanagh et aL, 1996), allows the 
effect of the spin-label celiooligosaccharides on 
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Figure 1. Chemical structures 
of the spin-labeled glycosides: <a) 
TEMPO-Glc 3 and (b) TEMPO-Clc 4 . 
Reduction of the paramagnetic 
nitroxide (N-O) moiety produces 
the diamagnetic hydroxy la mine 
(N-OH) derivatives of these 
glycosides. 
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each non-proline residue in 15 N-labeled CenC 
CBDs to be evaluated. Similar studies using spin- 
labeled ligands and heteronuciear NMR methods 
have been reported by several groups (Gellespie & 
Shortle, 1997a,b; Zhao et al, 1997; Kleerekoper et al, 
1995; Yu et al, 1994). Using this approach, we 
demonstrate that the modified sugars bind to 
CBD N1 and CBD^ in both possible orientations 
across their p-sheet binding clefts. This result is 
consistent with the approximate symmetry of the 
cellooligosaccharides and the positions of the side- 
chains forming the binding clehs of the two protein 
domains, as well as 15 N relaxation measurements 
which indicate that residues within the binding 
cleft of CBD N1 remain conformationally flexible in 
the presence of these sugars (P.EJ., E.B. & L.P.M, 
unpublished results). 

Results 

Preparation of TEMPO- 
labeled cellooligosaccharides 

Syntheses of the nitroxide spin-labeled ceilotrio- 
side (TEMPOGlc 3 ) and ceilotetraoside (TEMPO- 
Glc 4 ) were accomplished in a two-step chemo 
enzymatic process, thereby avoiding many of the 
difficulties inherent in the preparation of oligosac- 
charide derivatives bv chemical methods alone. 
A TEMPO-labeled glucoside was first synthesized 
via standard Koenigs-Knorr methodology, essen- 
tially according to a published procedure 
(Gnewuch & Sosnovsky, 1986; Plessas & Goldstein, 
1981). This product was characterized by mass 
and, to a limited extent, H-NMR spectroscopy. 
Full NMR spectroscopic characterization was per- 
formed on the catalytically hydrogenated (reduced) 
and acetylated hydroxylarnine derivative that no 
longer bears a free radical. Conversion of the 
TEMPO-glucoside to TEMPO-Glc 3 and Glc 4 was 
accomplished by successive transfer of glucose 
residues from a-glucosyl fluoride using a mutant 
Agrobacterium sp. p-glucosidase (glycosvnthase) in 
which the catalytic nucleophiie, GIu358, was 
replaced by an alanine residue (Mackenzie et al, 
1998). This approach exploits the well-known 
transglycosylation reaction catalyzed by retaining 
glycosidases, yet has the advantage that the 
mutant enzyme cannot hydrolyze the end pro- 
ducts, thereby resulting in good yields of —70% 
per coupling. The products were purified by HPLC 
and their identities confirmed by ionspray mass 
spectrometry, and after reduction and acerylation, 
by ^-NMR spectroscopy. 

Binding of TEMPO-labeled 
cellooligosaccharides to CBD N1 and CBD^ 

We have demonstrated previously., using NMR 
spectroscopy and isothermal titration eaJorimefrv, 
that the monomelic CBD Nj binds soluble cellooli- 
gosaccharides with a stoichiometry of 1:1 (Johnson 
et al, 1996a; Tomme et al, 1996a). Similar results 




Total Ligand Concentration (mM) 



Figure 2. The association constants of the CenC CBDs 
for TEMPO-labeled cellooligosaccharides were measured 
using l H- 15 N NMR spectroscopy. Shown are the nor- 
malized chemical shifts changes, or fraction bound, of 
the amide proton of Asn81 in CHD^ upon titration 
with TEMPOGlc 3 (filled diamonds) and TEMPO-Glc 4 
(filled squaresj, and the nitrogen atom of Vail 97 in 
CBD N2 upon addition of TEMPOGlc 4 (open squares). 
The solid and broken lines represent the best fits of the 
data to the Langmuir isotherm describing the binding of 
one ligand to a single protein site on CBD NI or CBD M2 , 
respectively. The total chemical shift changes for Asn81 
and Vall97 were extrapolated to be 0.1 ppm (*H N ) and 
0.76 ppm ( 15 N), respectively. 



have been obtained with CBD N2 (data not shown). 
Continuing this approach, the interactions ot the 
TEMPO-labeled cellooligosaccharides with CBD hli 
and CBD,^ were analyzed quantitatively by moni- 
toring the *H N and 15 N chemical shifts of the pro- 
teins upon titration with these soluble sugars. The 
spectral changes resulting from the addition of 
TEMPO-GIC3 and TEMPO-Glc 4 are similar to those 
observed with the corresponding unlabeled celloo- 
ligosaccharides, indicating that these ligands all 
bind in the regime of fast exchange on the chemical 
shift timescale to the same regions in CBD N1 and 
CBD N2 and with the same 1:1 stoichiometry fdata* 
not shown). Apparent or macroscopic equilibrium 
association constants were obtained by fitting the 
titration data to the Langmuir isotherm describing 
the binding of one ligand molecule to a single pro- 
tein sit^. (Figure 2). 

TEMK>Glc 3 binds to CBD N1 with an apparent 
K a of 690(±260) M" 1 . This is marginally higher 
than the value of 180(±60) M" 1 measured pre- 
viously for unlabeled cellotriose (Johnson et aL, 
1996a). The apparent K a values of TEMPO-Glc 4 
for CBD N1 and CBD N2 are 4200(±400) M~ l and 
3600(±2200) M" 1 , respectively, while the associ- 
ation constants of CBD N1 and CBD N -> for unlabeled 
cellotetraose are 4200(^720) M ! (Johnson et iL, 
1996a j and 7(«0<±650) M _i (data not shown). The 
relatively lar^e errors in the K a values measured 
for the TEMPO-labeled cellooligosaccharides reflect 
the fact, that the nitroxide efficiently broadens the 
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signals from many amide groups within the bind- 
ing cleft to the point that they are no longer obser- 
vable. In general, it is these amide groups that 
show the largest chemical shift changes upon 
addition of sugar to the protein and would other- 
wise be best suited for fitting to the binding iso- 
therm equation. Within the error of the data, we 
conclude that the TEMPO moiety does not signifi- 
cantly perturb the affinity of the CenC CBDs for 
these cellooligosaccharides. By way of comparison, 
increases in affinity of ~10 to 20-fold result from 
the lengthening of cellotriose or cellotetraose by 
one additional glycosyl unit to form cellotetraose 



or cellopentaose, respectively (Tomme et at., 
1996a). 

TEMPO-labeled cellooligosaccharides bind 
CBD N1 £nd CBDre in multipl orientations 

In addition to perturbing the chemical shifts of 
amide groups in CBD N1 and CBD^, the binding of 
TEMPO-Glc 3 and Glc 4 causes a decrease in the 
intensities of the signals from many of these 
groups due to paramagnetic relaxation enhance- 
ment. "This is readily seen by comparing the l H- 15 N 
HSQC spectra of two proteins complexed with the 
oxidized and reduced cellotetraose derivative 
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Figure 3. Portions of the non-sensitivity enhanced J H- 15 N HSQC.spectra of (a), (b) CBD N1 and (c), (d) CBD N2 
bound to (a), (c) oxidized or (b), (d) reduced TEMPO-GIc 4 . The final ligand-to-protein ratios of 2.2:1 and 1.4:1 corre- 
spond to ~80% and 65% saturation of CBD N1 and CBDn^, respectively. Peaks from amides that "disappear" because 
of paramagnetic relaxation due to the nearby unpaired electron in the nitroxide-labeled cellotetraose are identified. 
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(Figure 3). Qualitatively, it is clear that a number 
of resonances present in the spectra of proteins 
bound to the reduced hydroxylamine form of the 
sugar are "absent" in the spectra recorded with the 
oxidized nitroxide form, i.e. their intensities fall 
below a very low contour level. Thus, these amide 
groups must be close (^ 11 A; vide infra) to the free 
electron in the nitroxide moiety. 

The amide protons whose resonances are absent 
in the l H- l5 N HSQC spectra of the complexes of 
TEMPO-Glc 4 with CBD N1 and CBD N2 can be classi- 
fied into two groups that are located on the oppo- 
site ends of the binding clefts of these proteins. (As 
shown in Figure 5, the binding cleft of each CBD 
lies across an antiparallel 0-sheet A formed by five 
strands, arranged as A1-A2-A5-A3-A4.) In the case 
of CBD N1 , the first group includes Vall7, Alal8, 
Tyrl9 and Gly20 from p-strand Al, while the 
second group comprises Tyr85, Gly86 and Thr87 
from (3-strand A4 and the loop between A3 and 
A4. Similarly, for CBD,^, the first group is formed 
by Serl66, Leul67, Tyrl68, Glyl69 from p-strand 
Al, while the second group includes of Tyr234, 
Arg235 and Thr236 from p-strand A4 and the pre- 
ceding loop, in contrast, peaks from amide protons 
in the central p-strand A5 of each CBD are present 
in the *H- 15 N HSQC spectra. Inspection of the ter- 
tiary structures of CBD N1 and CBD^ reveals that 
the outermost strands, Al and A4, are separated 
by ^-20 A. The observation that the amide protons, 
which are most strongly perturbed by the nitroxide 
spin label, are located in both of these outer 
p-srrands cannot be accounted for by the binding 
of TEMPO-Glc 4 to the CBDs in a single orientation. 
The simplest explanation of this result is that 
TEMPO-Glc 4 associates with CBD N1 and CBD N2 in 
at least two distinct orientations, such that the 
nitroxide moiety lies at either edge of their sugar 
binding clefts. 

The peaks from amide protons in (3-strands Al 
and A4 that are absent in the spectrum of CBD N1 
bound to TEMPOGlc 4 also disappear upon bind- 
ing TEMPO-Glc 3 (see Supplementary Material). In 
addition, signals from several amide protons, 
including those of Thr21 in strand Al and Ala41, 
Gln42 £ , Tyr43, Vai48, Asn50, and Gly51 in strand 
A2, are weak in the spectrum of CBD N1 complexed 
with TEMPO-Glc 4 and absent in the presence of 
TEMPO-Glc 3 . This qualitative distinction may 
reflect a slight difference in the positions of the 
two sugars when bound to CBD N1 . Several pro- 
tons, including the indole H El of Trpl6 and the 
amide H N of Leul46, which are not in the binding 
cleft of CBD N1 , are also affected by TEMPO-Gk> 
This is attributed to non-specific effects arising 
from the addition of a 22-fold molar excess of the 
nitroxide-labeled sugar to the protein sample. 

Measurement of Af?, and AR 2 
relaxation enhancements 

A qualitative inspection of the l H- 15 N HSQC 
spectra of the two CBD-TEMPO-Glc 4 complexes 



provides immediate insights into the binding 
orientations of the sugars. Further details regarding 
the positions of the bound nitroxide moiety can be 
gained bv a quantitative analysis of the effects of 
the spin label on the relaxation properties of the 
two CBDs. To reduce the possibility of non-specific 
interactions, CBD N1 and CBD,^ were studied at 
saturation levels of approximately 80% and 65%, 
respectively (Figure 2). 

The enhancements of the longitudinal (AR X ) and 
transverse (AKJ relaxation rates of CBD N1 and 
CBD N2 due to complexation with TEMPO-Glc 4 are 
presented in Figures 4 and 5. Data for CBD N1 
bound to TEMPO-Glc 3 are provided as Supplemen- 
tary Material. The AR 1 values were determined 
from the differences in the T x lifetimes, measured 
using a l H- l5 N HSQC sequence as a read-out 
of a non-selective inversion-recovery sequence 
(Cavanagh et al, 1996), for the proteins in the pre- 
sence of the nitroxide and hydroxylamine deriva- 
tives of cellotetraose. The AR 2 values were 
obtained using two complementary methods. In 
the first, the effect of the spin label on the proton 
transverse relaxation (AR 2fV oi) oi the l H N during 
the INEPT and reverse-lNEPT period of the HSQC 
pulse sequence was measured by comparing the 
total volume of its cross-peak in spectra recorded 
with the oxidized and reduced sugar. In the 
second^ aK 2 ,lw was determined from the change 
in the proton line-width of the amide measured 
with the two forms of the modified cellotetraose. 
The methods for detenruning AR Y and AR 2 assume 
that contributions to the peak volumes and line- 
widths due to scalar couplings and relaxation pro- 
cesses other than that mediated by the free radical 
remain constant, and that the oxidized and 
reduced glycosides interact identically with the 
CBDs. The lack of any significant amide ll H N or 
I5 N chemical shift perturbation upon reduction of 
the TEMPO group provides strong support for 
these assumptions (Figure 3). 

Following these approaches, 101 AR lf tA 
ARivai' and 99 a ^2,lw values were obtained for 
CBD N1 and 113 AR V 44 AR 2yol , and 118 A&m.w- 
values for CBD N2 (Figure 4). Rate enhancements 
were not measurable for the remaining non-pro- 
line residues for two reasons, besides spectral 
overlap. First, in the case of those amide protons 
that are closest to the nitroxide moiety, severe 
line-broadening rendered their signals undetect- 
able in the spectra recorded with oxidized 
TEMPO-Glc 4 . For these residues, we estimate 
AK 1 >2 i 5s" 1 and AR 2 >80s ! . Second, in the 
case uf those amide protons furthest trom the 
nitroxide group, changes in their relaxation rates 
were top smalJ to measure reliably. In particular, 
dilution effects due to the addition of ascorbic 
acid and subsequent pH adjustments caused the 
volumes of the peaks in the X H- 15 N HSQC spec- 
tra of the proteins recorded with the reduced 
sugar to decrease slightly. This led to small 
apparent negative AR 2 .voi values for many 
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Figure 5. Paramagnetic relaxation studies demonstrate that TEMPO-Glc 4 binds to each CenC CBD in at least two 
orientations such that the nitroxide moiety can lie at either end of the binding cleft. Shown are ribbon diagrams of 
CBDn, (left) and CBD N2 (right) positioned for the reader to look across (top) and down onto (bottom) the cellooligo- 
saccharide binding groove that is formed by p-sheet A in each protein. The individual strands of this (5-sheet are 
labeled. The ribbons are ramp-colored based upon A£ 2 .lw values ranging from 1 Hz (light yellow) to 80 Hz (dark 
orange). Residues whose l tir signals disappear completely in the presence of TEMPO-Glc 4 are indicated in red 
(AR 2 .lw > 80 Hz). The proline residues and other residues for which, no results were obtained are shown in white. 
The green and blue spheres indicate representative positions of the Jutroxide groups calculated using an ensemble- 
averaging protocol in X-PLOR with measured r cff as distance restraints. Note that the TEMPO-Glc 4 can occupy oniv 
one of these two orientations at any given time. 



amides whose relaxation rates were not perturbed 
significantly by the spin label. 

As seen in Figure 4, ARi and AR 2 show similar 
trends, yet the latter provide a more sensitive indi- 
cator of the effects of the spin-labeled sugars on 
CBD N1 and CBD N2 . In the cases of amide protons 
with resolved J H- 15 N HSQC cross-peaks that 
experience moderate paramagnetic relaxation 
enhancements, the values of AR 2 .voi an d AR 2LW 
agree well (ratio = 1.7(±1)). However, a more com- 
plete data set could be obtained by measuring 
changes in line-width rather than volume due to 
the above mentioned problems associated with 
dilution effects and difficulties encountered with 
partially overlapping peaks. 

Based on these measurements, it is immediately 
apparent that amide groups in four regions of 
CBD N1 and four regions of CBD N2 experience 
significant paramagnetic relaxation enhancements 
due to the bound TEMPO-Glc 4 . These include 



Glyl5 to Thr21, Tyr43 to AsnSl, Gln80 to Asp90 
and Glul22 to Serl33 in CBD N1 and G1v16(K-*q 
Glul72, Trpl92 to Asnl99, Glu229 to Glu239 and 
Gly266 to Ala279 in CBD N2 . These residues lie 
within the five (5-strands, Al to A5, and adjacent 
loops that torm the cellulose-binding cletts of 
each protein. Of these four regions, amide pro- 
teins in strands Al and A4 that comprise the 
outer edges of the binding face have markedly 
larger AR X and AR 2 values than do those in the 
central strand A5. this verifies that TEMPO-Glc 4 
binds CBD N1 and CBD N2 in multiple orientations 
such that the nitroxide spin label is located at 
either end ot the binding cleft. In contrast, the 
relaxation rates of amide groups located on the 
opposite side of each protein relative to this cleft 
(P-strands Bl to B5) do not exhibit marked 
enhancements. Similar patterns are observed with 
TEMPO-Glc 3 bound to CBD N1 (Supplementary 
Material). 



Calculati n of electron-prot n distances 

Values of x c , the correlation time for the fluctu- 
ation of the electronic-nuclear dipole-dipole inter- 
action, were calculated for individual residues in 
both CBD N1 and CBD^ bound to TEMPOGlc 4 
using the measured AR a and AR 2 .voi ° r A^xw 
rate enhancements. In the case of CBD N1 , t c values 
for 81 amide protons ranged from 0.1 ns to 3.6 ns, 
with a mean of 1.7(±0.5) ns. For CBD^, these ran- 
ged from 0.1 ns to 6.5 ns for 101 amide protons, 
with a mean of 1.9(±0.8) ns (Figure 6). The corre- 
lation time x c is dependent upon both the relax- 
ation of the electron and motions of the electron- 
proton vector (l/x c = l/x s + 1/t r ). In this equation, 
t s is the longitudinal relaxation time of the free 
electron and x R is the effective rotational corre- 
lation time of the electron-proton vector. The latter 
is dependent on the motional characteristics of the 
protein-ligand complex. Since t R is in the range ot 
10 * to 10 9 s for most proteins studied by solution 
NMR methods, and x s is typically longer than 10 ~ 7 
s for nitroxide radicals, x c is essentially equal to x R 
(Kosen, 1989). The correlation times for the global 
tumbling of CBD N1 and CBD,^ saturated with cel- 
lopentaose are approximately 7.4 ns, as determined 
by 15 N relaxation methods (P.E.J., E.B. & L.P.M., 
unpublished results). The shorter correlation times 
extracted from the paramagnetic relaxation 
enhancements may reflect conformational mobility 
of the bound TEMPO moiety. The large degree of 
variability in the x c values measured for the amide 
protons in CBD N1 and CBD^ is attributed to com- 



pounding errors from the AR X and AR 2 measure- 
ments, as well as different orientation-dependent 
effects of internal mobility on the distance between 
the free electron and the l H N of individual amide 
protons. 

Using the x c values measured for each residue, 
the effective electron-proton distances, r eff , between 
the TEMPOGlc 4 nitroxide and the individual 
amide protons in CBD N1 and CBD^ were calcu- 
lated. A similar analysis of the data for the 
TEMPOGlc 3 was not performed due to the occur- 
rence of non-specific relaxation enhancement. The 
most complete set of relaxation data was obtained 
from linewidth measurements, and thus the dis- 
tances reported in Figure 6 were deterrnined using 
the AR 2 ,lw values; however, similar results were 
derived from the AR 2 ,voi ra *e enhancements. Based 
on the data in Figure 6, we find that the measured 
values of r ett range from 11 to 23 A. Consistent 
with the prev ious discussion of the data presented 
in Figures 2 and 3, the amide protons in CBD N - 
and CBD^ closest to the nitroxide group are 
located-in p-strands Al and A4 on the binding 
faces of- the proteins, whereas those that are furth- 
est form strands Bl to B5. 



Structure calculations 

The positions of the nitroxide moiety in TEMPO 
Glc 4 bound to CBD N1 and CBD^ were calculated 
using the measured r eff as distance restraints for a 
simple minimization routine in X-PLOR (Brunger, 
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Figure 6. The correlation times, x c , and effective electron-proton distances, r ctf , calculated for (a), (b) CBD N1 and (c), 
(d) CBDfjj bound to TEMPO-Glc 4 . Residues whose signals disappear completely due to severe linebroadening are 
shown with a shaded bar at an upper bounds of r cif =. 11 A. The locations of the p-strands forming sheets A and B in 
the two proteins are indicated by the open and filled arrows, respectively. 



1992). To account for multiple binding orientations, 
we used an ensemble-averaging protocol in which 
r = [1/n Er^]~ l/6 where r is the electron-proton 
distance for each of n conformations, assumed to 
be equally populated (Bonvin & Briinger, 19%). 
A total or 60 rninimization runs were performed 
using CBD N1 or CBD N2 in a system with n = 1, 2, 
or 3 nitroxide groups, modeled as "atoms" of mass 
30 amu and a van der Waals radii of 2.5 A. When 
only one nitroxide group is utilized, the "atom" is 
always located in the middle of the binding deft, 
approximately above 0-strand A5. However, this 
solution poorly satisfies the experimental restraints 
requiring the spin label to be closer to amide pro- 
tons on the edges of the binding cleft than to those 
near its center. When an ensemble of two nitroxide 
atoms is considered, we find that in 31/60 and 60/ 
60 cases for CBD Nl and CBD N2/ respectively, one 
atom is localized to each edge of the binding cleft 
(Figure 5). In the remaining 29/60 cases for CBD N1 , 
one atom is found near the middle of the binding 
cleft and the other distant from the protein. The 
lowest energies clearly occur in the former situ- 
ation, with the nitroxide group near [3-strands Al 
and A4. The difference between the results 
obtained for the two proteins is attributed to the 
exact choice of the bounds used for the distance 
restraints. When an ensemble of three nitroxide 
atoms is generated, 24/60 and 42/60 cases for 
CBD N1 and CBD N2 , respectively, have one atom at 
each edge of the binding cleft and the third distant 
from the protein. In the remaining 36/60 and 18/ 
60 cases, the nitroxide groups occupy a variety of 
positions, close to and distant from the proteins. 

Based on these calculations, we conclude that 
the simplest explanation for the observed patterns 
of paramagnetic relaxation enhancement is that 
TEMPO-Glc 4 binds both CBD N1 and CBD N2 in two 
possible orientations such that the nitroxide moiety 
lies at either end of the binding cleft (Figure 5). In 
support of these results, the separation of -23 A 
between the positions calculated for the nitroxide 
atoms matches approximately the length of a 
TEMPCM31c 4 molecule. 

Discussion 

Binding affinity for spin- 
labeled cellooligosaccharides 

CBD N1 and CBD N2 bind TEMPO-Glc 3 and 
TEMPO-Glc 4 with affinities that are approximately 
equal to those measured previously for the unmo- 
dified sugars. Although the six-membered ring of 
TEMPO resembles in some ways that of a gluco- 
pyranosyl unit, it differs due to the presence of 
methvl substituents and the absence of equatorial 
hvdroxyl groups. As discussed by Tomme et al 
(1996a)^ calorimetric studies have revealed that the 
binding of soluble cellooligosaccharides to CBD N1 
is driven by enthalpically favorable interactions 
such as hydrogen bonding and van der Waals con- 
tacts. We therefore conclude that the TEMPO 
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group does not interact significantly with CBD N1 
or CBDmo and is positioned in or near the binding 
cleft solelv bv virtue of its covalent bonding to the 
cellooligosaccharides. This statement is supported 
by the observation that the chemical shift changes 
accompanying the binding of the modified and 
unmodified cellooligosaccharides to the two pro- 
teins are similar. We also conclude that the pre- 
sence of an anomeric hydroxy 1 group does not 
contribute to the binding of cellooligosaccharides 
by CBD Ni or CBD^. This is not unexpected given 
that cellulose, the natural ligand for these proteins, 
exists in a highly polymeric form. 

Multiple binding orientations 

Qualitative and quantitative analyses of the 
effects ot TEMPO-Glc 3 and TEMPO-Glc 4 on the 
relaxation properties of CBD N1 and CBD N2 clearly 
demonstrate that the labeled cellooligosaccharides 
are bound in at least two orientations. These orien- 
tations position the nitroxide moiety at either edge 
of the binding cleft of each CBD, such that amide 
protons in p-strands Al and A4, as well as in the 
loop between A3 and A4, are perturbed by the 
spin label to a greater extent that those in the cen- 
tral strand A5 (Figure 5). Since the CBDs form 1:1 
complexes with the cellooligosaccharides, these 
orientations are mutually exclusive such that in 
any individual CBD complex, the sugar is bound 
in one- direction or the other. Given that the 
TEMPO-labeled sugars and the corresponding 
unmodified cellooligosaccharides bind to the CenC 
CBDs with similar affinities and produce similar 
chemic^ shift perturbations, we conclude GW 
Glc 4 , and by inference, amorphous cellulose and 
other soluble p-glucans, are also bound in multiple 
orientations. 

Binding of the unmodified and TEMPO-labeled 
cellooligosaccharides occurs on an NMR time-scale 
of fast exchange as evident by the progressive 
change in the chemical shirts of CBD N1 and CBD N2 
upon the addition of these sugars. Therefore, the 
chemical shift of an amide proton at any point in 
the titration reflects a population-weighted average 
of its chemical shifts in the free and all-bound 
forms of the CBD. As a result, the equilibrium 
association constants determined by NMR and 
calorimetric methods represent apparent or macro- 
scopic K a values that are the summation of the 
microscopic binding constants for each possible 
orientation of the sugar-protein complex (Wyman 

& Gill, 1990). 

An important question to ask is whether a pre- 
ference for one binding orientation exists. During 
the titrations of CBD N1 and CBD N2 with the nitrox- 
ide-labeled sugars, it was noted that the signals 
from residues on p-strand Al disappeared at lower 
levels of saturation than did those on A4 (Johnson, 
1998). This could indicate that the binding ot 
TEMPO-Glc 4 in an orientation with the nitroxide 
moietv-near p-strand Al is slightly favored, or 
simply .that the nitroxide is positioned closer to the 



i avtrKS'iststiuutiyusttuututnut* ouiuttiy uy ot*/iw udl/s 



amides in this strand relative to those in A4. If we 
assume that the TEMPOlabeled sugars bind in fast 
exchange between two possible orientations, then 
the measured r e(f is given by: 

Here r t and r 2 are the distances from an amide pro- 
ton to the two positions of the free electron, and / bl 
and f b2 are the fractions of total protein bound by 
the ligand in each orientation. Using the positions 
of the nitroxide atoms shown in Figure 5, we calcu- 
late that varying/ bl from 0.1 to 0.9 does not change 
the predicted r eH for amide protons near the edges 
of the CBD N1 and CBD^ binding clefts by more 
±20%. In other words, within experimental error, 
the effects of any small population differences 
between the spin-labeled ligands bound in one or 
the other orientation cannot be distinguished due 
to the sixth-power dependence of the relaxation 
enhancement on the proton-electron separation. 
This is readily seen by considering a case for 
which r x = 10 A and r 2 = 20 A. The calculated r eH 
is 10.2, 11.2, and 14.4 A for f hl of 0.9, 0.5, and 
0.1, respectively. A similar conclusion was made 
by Bonvin & Briinger (1995, 1996) when they 
demonstrated that NOE-derived distances are not 
sufficient to determine the fractional occupancies 
of the individual conformations of a protein that 
exhibits conformational heterogeneity. Because of 
this limitation, we conservatively estimate that the 
relative association constants for the binding of 
TEMPO-Glc 4 to CBD N1 or CBD N2 in the two poss- 
ible orientations are within a factor of approxi- 
mately five- to tenfold. Note also that if r x = r 2 and 
/bi=/b^ then saturation of the CBD at 65% 

(/bi = 0- 325 ) md 80 % (/bi = °- 4 ) leads to ^ r *« 
that is overestimated by only ^7% and 4%, 

respectively. 

Extrapolating from these studies, it is reasonable 
to suggest that TEMPOGlc 3 and TEMPOGlc 4 may 
be bound in multiple conformations that have one 
or the other of these two general orientations, yet 
also differing in the exact position (or register) of 
the sugar with the p-sheet clefts of CBD N1 and 
CBD N2 . Evidence for this hypothesis stems from 
two observations. First, the longer r eU values 
(>15 A) measured in this study, were consistently 
shorter than the electron-amide proton distances 
predicted from the calculated positions of the nitr- 
oxide atoms shown in Figure 5. Bearing in mind 
the significant errors associated with the measure- 
ment of these distances (see below), this could 
reflect a small population of ligand shifted to pos- 
ition the nitroxide near the center of the binding 
clefts of the CBDs and thus transiently close to 
amide protons in strand A5 and p-sheet B. Second, 
as noted previously (Johnson et al. f 1996a), the X H N 
and l5 N chemical shifts of CBD^i and CBD N2 are 
perturbed similarly upon binding of Glc 3 , Glc 4 , 
Glc 5 , and Glc*. This suggests that the four cellooli- 
gosaccharides interact structurally with the CBDs 



in the same manner, possibly by sliding within 
their binding clefts. 

Structural symmetry can xplafn the multiple - 
binding orientations — 

An explanation for the binding of cellooligosae- 
charides by CBDnj and CBD^ m multiple orien- 
tations is provided by inspection of the structures 
of both the ligands and the protein molecules. 
Hydrogen bond formation between the oligosac- 
charide and the protein provides an enthalpic driv- 
ing force for binding (Tomme et aL, 1996a), and 
thus the locations of polar atoms within the mol- 
ecules should be considered. As shown in Figure 7. 
the hydroxyl and hydroxymethyl groups of cellote- 
traose are approximately symmetrically disposed 
such that comparable hydrogen bonds could be 
formed-to the CBDs whether the reducing end of 
the ceUooligosaccharide is located closer to 
p-strand Al or A4. In addition, as illustrated in 
Figure 8, the polar, non-polar, and aromatic side- 
chains implicated in ceilooligosaccharide recog- 
nition (Johnson et aL, 1996a,b) are approximately 
symmetrically positioned about the centers of the 
binding clefts of CBD N1 and CBD^. This provides 




Figure ~. Cellotetraose has approximate symmetry 
such that its hydroxyl and hydroxymethyl groups 
would occupy similar positions when the sugar chain is 
bound in either orientation by CBDmj and CBD N2 - 
(a), (b) Cellotetraose (Gepier et al, 1994) with its redu- 
cing end positioned at the bottom or top of the Figure, 
respectively. The molecules are related by successive 
180 c rotations about axes perpendicular and parallel to 
the long axis of the ceilooligosaccharide. 
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Figure 8. The residues implicated in cellulose recognition by the CenC CBDs are positioned approximately symme- 
trically about the centers of their binding clefts. This mav facilitate the binding of a cellooligosaccharide chain in 
either" orientation. Shown are ribbon diagrams of CBD N1 (left) and CBD N , (right) positioned for the reader to look 
across (top) and down onto (bottom) the cellooligosaccharide binding cleft that is formed by 0-sheet A in each pro- 
tein. The side-chains of selected residues within the cellulose binding clefts are color coded as blue, polar; red ; hydro- 
phobic; and yellow, aromatic. 



further opportunity for the complex formation of 
the cellooligosaccharides in multiple orientations, 
while inamtaining similar patterns of hydrogen 
bonds to the equatorial hydroxyl groups and 
van der Waals contacts to the glucopyranose 
rings. 

It is unlikely that all the oxygen atoms in cellote- 
traose and the polar atoms in the binding clefts of 
CBD N1 or CBD^ are involved simultaneously in 
intermolecular hydrogen bonding. The association 
of cellotetraose with CBD N1 involves a AG r of 
-4.9 kcal mol" 1 at 35 °C (Tomme et al, 1996a). If 
the sole contribution to this binding free energy is 
assumed to be hydrogen bonding involving the 14 
sugar hydroxyl groups, each would account for 
only -0.35 kcal mol~ . Thermodynamic studies of 
several protein-carbohydrate complexes have 
demonstrated that the presence of some hydroxyl 
groups is essential for binding, whereas others are 
unnecessary or even a hindrance. The net contri- 
butions of individual hydroxyl groups to binding 
generally range from -0.5 kcal mol" 1 to -2 kcal 
mol " 1 , yet can be as high -6 kcal mol 1 (Lemieux, 
19%; Quiocho, 1993; Street et al, 1986). Based on 



these values, CBDnj likely forms between 2.5 and 
ten hydrogen bonds to cellotetraose. Given the 
large number of potential hydrogen-bonding sites 
on this sugar and on the side-chains lining the 
binding clefts of CBD N1 and CBD N2 , it is reason- 
able to^suggest that cellooligosaccharides, as w?W 
as amorphous cellulose, could bind these proteins 
in many possible conformations with similar free 
energies. These conformations include opposite 
orientations of the sugar chain across the p-sheet 
faces of the CBDs, as well as possible multiple reg- 
isters within their binding clefts. 

Further insights into the binding promiscuity of 
the CenC CBDs are also gained from dynamic 
studies of the proteins and protein-sugar com- 
plexes using N and 2 H relaxation measure- 
ments. As will be reported elsewhere, the 
backbone amide protons and methyl<ontaining 
side-chains of CBCVj! show motional disorder on 
a nano- to picosecond time-scale that is not 
restricted upon binding cellopentaose. This 
internal mobility or flexibility may also allow the 
cellooligosaccharides to bind in multiple, rapidly 
interconverting conformations. 



Use f spin labels f r obtaining ligand-protein 
distance restraints 

fvTifr^viHftJ a hglpr| rpllnoliffn^rrharides clearly 

pr ovide a powerful metho d for characterizing pro- 
tel rTcarBoliyd rata Interactions, ijualitatrvely, the 
effects of the tree radical on tne NMR spectra of 
CBD N1 and CBD N2 yielded immediate evidence for 
the binding of TEMPOGlc 3 and Glc 4 in multiple 
orientations. Although it was also possible to cal- 
culate the effective positions of the nitroxide moi- 
ety bound in a minimum of two possible 
orientations on the CBDs from a quantitative anal- 
ysis of the measured paramagnetic relaxation 
enhancement, the accuracy of these models is lim- 
ited for numerous reasons. These include: (i) com- 
plications associated with multiple conformations 
and partial saturation, as discussed above; (ii) 
the uncharacterized conformational mobility of the 
bound TEMPO group; (iii) inaccuracies in the 
NMR-derived structures of CBD N1 and CBD^ cal- 
culated without inclusion of the bound Glc 4 or 
Glc 5 , respectively (Johnson et aL. 1996b); (iv) errors, 
which tend to lead to an underestimation of longer 
values of r e(t/ arising from difficulties in measuring 
small values of Ai^ and AR 2 , and from the possi- 
bility of relaxation enhancement due to non- 
specific binding or to collisions between CBD- 
TEMPO-Glc 4 complexes in solution; and (v) the- 
lar kof distance restraints with < Vj A resulting 
fro m the efficiency o f paramagnetic relaxation. The 
latter problem could be add ressed by studyin g 
CBD^t and LBD M -> partially "T itrated with small 
qua ritttfer of TEMPO-G ku o ^w TEK m ixturg&j^the 
labeiecT suear in its oxidized and reduced torms i n 
orde rTo"identiry the amides closest to the bound 
nitroxide^ 

T5"understand in detail the mechanism of cellu- 
lose binding by the two Family IV CBDs, we are 
currently exploiting glycosynthases to prepare 
selectively I3 C-labeled cellooligosaccharides. these 
compounds will be utilized for isotope-edited and 
filtered experiments aimed at resolving and assign- 
ing intermolecular NOEs between the CBDs and 
the bound sugars. Using a combination of short- 
range distance restraints derived from interproton 
NOE interactions and long-range restraints from 
electron-proton paramagnetic relaxation enhance- 
ments, we hope to define more accurate structural 
models of the ensembles of the complexes of 
CBD N1 and CBD^ with cellooligosaccharides. 

Biological implications 

Studies of numerous cellulases and hemicellu- 
lases indicate that CBDs facilitate hydrolysis by 
passively increasing local enzyme concentrations 
on the surface of cellulosic and hemicellulosic sub- 
strates and, possibly, by actively disrupting non- 
covalent interactions between the chains of these 
polysaccharides (Tomme et aL, 1995). The speci- 
ficity of CBDs for the various aliomorphs of cellu- 
lose (e.g. crystalline or amorphous) will also serve 



to target these enzymes to distinct regions of this 
complex substrate. 

The results presented here demonstrate clearty 
that in isolation, CBD Nl and CBDnj each bind sol- 
uble cellooligosaccharides in multiple orientations. 
This immediately prompts the question as to the 
role played by these CBDs within their native con- 
text. CenC is a 1069 residue p-l,4-giucanase, conxh 
posed of CBD N1 and CBD^ arranged in tandem ax 
its N terminus, a central family 9 catalytic domain^ 
and two additional domains of unknown functioiv 
at its C /terminus (Coutinho et aL, 1992, 1991). The; 
enzyme cleaves a variety of cellulosic substrates 
and, based on an analysis of its hydrolysis of car- 
boxy methyl-cellulose, is semi-processive with both 
endo- and exoglucanase activities(Tomme et aL. 
1996b). Although the structures of the isolated 
family IV binding domains (CBD N1 and CBD N2 ) 
and the homologous Family 9 catalytic domains 
from Clostridium thermocellum celD (Juy et aL, 1992) 
and TJiemtonionospora fusca E4 (Sakon et aL, 1997) 
have been determined, the spatial arrangement of 
these modules within native C fimi CenC remains 
to be defined. 

CBD N1 and CBD^ are juxtaposed without an 
intervening linker. Given that the affinity of the* 
each domain for cellotetraose is equivalent whether 
joined or in isolation (unpublished), it is likely that 
CBD N1N2 can also bind oligomeric sugars in mul- 
tiple orientations. This theory is reasonable, as the 
binding clefts of the two homologous CBDs both 
span approximately five glycosyl units and thus 
each domain in CBD N1N2 can independently 
accommodate a single cellooligosaccharide mol- 
ecule. Keliminary studies indicate that the affinity 
of CBD^jt^ for phosphoric acid swollen cellulose 
is only approximately 2 fold greater than that of 
CBD N1 for this polymeric substrate (Tomme et aL. 
1996a). This implies that, when joined in tandem, 
these CBDs bind amorphous cellulose in an addi- 
tive, rather than co-operative, manner. A simple 
interpretation of this result is that the two domains 
comprising CBD N1N2 are structurally constrained, 
due to the lack of a flexible linker, such that they 
cannot bind simultaneously to adjacent regions ■ of •*- 
a single polymer chain. We therefore speculate that 
the tandem CBDs anchor CenC to its natural sub- 
strate by bridging chains of amorphous cellulose 
without a strong preference for their orientations. 
Note however, as discussed above, we can only 
estimate that the relative association constants for 
the binding of TEMPOGlc 4 to CBD N1 or CBD N2 in 
the two possible orientations are within a factor of 
approximately five- to tenfold. Thus in combi- 
nation. CBD N1N2 may in fact exhibit a significant 
overall preference for binding strands of amor- 
phous cellulose positioned to match the yet 
unknown orientation of the two domains within 
this tandem CBD. Resolution of this important 
question hinges upon the determination of the 
structure of CBD N1N2 as well as further character- 
ization of naturally occurring amorphous cellulose 
(Bayer et aL, 1998). 
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CBD N1N2 is connected to the catalytic domain of 
CenC by a short proiine-rich polypeptide linker 
sequence (Coutinho et al., 1991, 1992). In contrast 
to the isolated CBDs, it is certain that this catalytic 
domain has a definite orientational requirement for 
the binding and hydrolysis of cellulosic substrates. 
Although the exact boundaries and thus length of 
the linker sequence is unknown, the observation 
that it is highly sensitive to proteolytic cleavage 
(Tomme et al, 1996b) suggests that the linker 
affords at least some degree of flexibility between 
the binding and catalytic domains of CenC. This 
may allow the cleavage of properly oriented chains 
of amorphous cellulose other than those to which 
the CBDs are bound. Processivity would result 
from the diffusional sliding of both the binding 
and catalytic domains along the substrate cellulose 
chains. Enzymatic studies of CenC with and with- 
out CBD N1N2 , as well as structural characterization 
of the catalytic domain, are clearly necessary to 
understand the role of these Family IV CBDs in 
modulating the activity of this C.fimi p-l,4-gluca- 
nase. 

Although over 180 putative CBDs in 13 families 
have been identified, little is known about the 
detailed mechanism of cellulose and hemicellulose 
binding by these protein domains. Studies of sev- 
eral family 2 CBDs, such as those from C fimi Cex, 
have demonstrated that binding to crystalline cel- 
lulose is mediated by conserved aromatic residues 
aligned along a flat face of the molecule (Xu et al, 

1995) . Calorimetric measurements have proven 
that binding is entropically driven(Creagh et al, 

1996) , indicative of hydrophobic stacking of the 
aromatic side-chains on the glucose rings. Hydro- 
dynamic studies have also revealed that CBD Cex 
diffuses readily across the surface of crystalline cel- 
lulose Qervis et al, 1997). We therefore speculate 
that CBD Cex , as well as members of other CBD 
families, also binds in multiple orientations to cel- 
lulose. In contrast, based on the crystal structure of 
a fragment of the endo/exocellulase E4 containing 
a family 3c CBD and a family 9 catalytic domain, 
Sakon et al (1997) have hypothesized that the bind- 
ing domain aids in the activity of this enzyme by 
helping to feed a cellulose chain, bound in a 
defined orientation, into the active site of the cata- 
lytic domain. This distinctive behaviour may result 
from the intimate structural association of the two 
domains in the T.fusca endo/exocellulase. 

In summary, we have shown that CBD N1 and 
CBD N2 from C. fimi CenC bind TEMPOlabeled cel- 
looligosaccharides in multiple orientations. This 
promiscuous mode of sugar recognition can be 
attributed to the approximate symmetry of the 
hydrogen bonding groups on cellulose and on the 
side-chains lining the binding clefts of these two 
CBDs. Further structural studies of both native 
CenC and its substrate, amorphous cellulose, are 
necessary to understand the mechanisms by which 
this and other cellulolytic enzymes function in the 
efficient degradation of biomass. 



Materials and M thods 

Synthesis of TEMPO-labeled cello Hg saccharid s . 

Details of the synthesis of TEMPOGlc 3 and TEMPO- 
Glc 4 will be provided elsewhere. lonspray mass spec- 
troscopy of the final products yielded 658 (M+), 659 
(M + 1) and 660 (M + 2) for TEMPO-Glc3 (calculated for 
C^H^NO^: 6587) and 820 (M+), 821 (M + 1), and 822 
(M + 2) for TEMPO-Glc 4 (calculated for C^H^NO^: 
820.8). Useful *H-NMR data could not be obtained for 
these products due to the presence of the spin label. 
Structural characterization was therefore achieved by 
reduction of the nitroxide group to a hydroxy la mine and 
acetvlation of the entire molecule. The spectral data for 
these derivatives of TEMPO-Glc 3 and TEMPO-Glc 4 , 
respectively, are provided below. 

2,2,6,6-tetramethylpiperidin- 1-acetoxyt-4-yl 2,3,4,6-tetra- 
O -acetyl^ -D-gtucopyranosyH1-+ 4)-2, 3, 6-triO -acetyl- 
$-Chyluttpyranosyl-(1~+4)-2,3,6'tri-0-acetyl-$- 
o-glucopyranoside 

X H NMR (500 MHz, CDC1 3 ) 5: 5.14 (dd, 1 H, J 3A 
9.6 Hz, H-3), 5.13-5.09 (m, 2 H, H-3' H-3"), 5.05 (dd, 1 H, 
/ 4 . 5 . 9.5 Hz, H-4"), 4.89 (dd, 1 H, ] rx 8.8 Hz, ] TX 
8.1 Hz, H-2"), 4.85 (dd, 1 H, ^ 9.5 Hz, J 2 . r , 8.7 Hz, H- 
2'), 4.83 (dd, 1 H. / 2<3 9.6, Hz } 2 , 7.3 Hz, H-2), 4.60-4.47 
(m, 4 H, H-l H-l' H-l" H6a), 4.42 (dd, 1 H. J^^ 
11.7 Hz, ^ 5 1.5 Hz, H-6a') # 4.34 (dd, 1 H, / fa 
12.4 Hz, . x 4.4 Hz, H-6a"), 4.14 (dd, 1 H, / hb y 4.5 Ha, 
H-6b ). 4.07 (dd, 1 H, 5.9 Hz, H-6b), 4.04 (dd, 1 H, 
Jrt-s- 1.6' Hz, H-6b"), 3.93 (m, 1 H, H-l'"), 3.76 (dd, 1 S, 
/, 5 9.4-Hz, H-4'), 3.71 (dd, lH,/ 4 3 9.5 Hz, H-4), 3.65 
(m, 1 H, H-5"), 3.63-3.58 (m, 2 H, H-5 H-5'), 2.14-1.98 (33 
H, 11 x Ac), 1.83-1.38 (m, 4 H, H-2'" H-3'"), 1.13 (6 ft, 
CH 3 ), 1.10 (6 H, CH 3 ). MS (lonspray) 1123 (M + 1); cal- 
culated for C 4V H 71 N02h (1122.1). 

2,2,6,6'Tetramethylpiperidin- 1 -acetoxyl-4-yl 2,3,4,6-tetra- 
0-acetylj>-D-glucopyranosyH1-> 4)- 2,3,6-triO-acetyl- 
$'D'gluc$pyranosyl-(1^> 4)- 2,3,6-tri-0-acetyl-f>~ 
D-glucopyranosyl -(1^>4)- 2,3,6-tri-O -acetyl^- 
o-glucopyranoside 

l H NMR (400 MHz, CDC1 3 ) 6: 5.15 (dd, 1 H, h A 
9.5 Hz, H-3), 5.12-5.08 (m, 3 H, H-3' H-3" H-3" ), 5.04 
(dd, 1 H, J 4 ~ y- 9.5 Hz, H-4'"), 4.89 (dd, 1 H, ] T X - 8.9 Hz,. 
/ 2 ~ r . 8.0 Hz, H-2"'), 4.86- 4.81 (m, 3 H, H-2 H-2' Ht2"T, 
4.60-4.47 (m, 3 H, H-l H-V H-l" H-l"' H6a), 4.60-4.32 
(m, 3 H, H-Oa H-6a" H-6a'"), 4.15-1.00 (m, 4 H, H-6b H- 
6b' H-6b" H-6b ") 3.95 (m, 1 H, H-l""), 3.81-3.68 (m, 3 H, 
H-4 H-4" H-4""), 3.66 (m, 1 H, H-5"'), 3.62-3.57 (m, 3 A, 
H-5 H-5' H-5"), 2.15-1.96 (42 H, 14 x Ac), 1.85-1.37 (m, 4 
H, H-2"" H-3""), 1.15-1.10 (12 H, 4 x CH 3 ). lonspray MS 
1411 (M + 1); calculated for C ftl H 87 NO*, (1410.34). 

Protein samples 

The l5 N-labeled CBD N1 (residues 1-152 of CenC) and 
CBD N2 (residues 146-296) were produced by expression 
of the plasmids pTugNl and pTugN2 in Escherichia cdi 
JM101 cells grown in minimum media containing 1 g/1 
of l5 NH 4 Cl and 1 g/1 of 99% 15 N-labeled Isogro (Isotec 
Inc.; Johnson et al, 1996a). The secreted proteins were 
purified from the culture supernatant and the periplas- 
mic fraction using both affinity chroma tograpny on cel- 
lulose (Avicel) and ion-exchange chromatography, as 



described (Johnson et al, 1996a). Samples of CBDm! and 
CBDm2 were exchanged into a final buffer of 50 mM 
sodium chloride, 50 mM sodium 2 H 3 -acetate, 3 to 5 mM 
CaCl 2 , and 0.02% sodium azide in 10% 2 H 2 O/90% H 2 0 
at pH* 6.1 using a microsep concentration device (Fil- 
tron). CBD N1 binds calcium with an equilibrium associ- 
ation constant of -10 s M~ l at pH 6.0 and thus exists as a 
1:1 CBD N1 -Ca +2 complex under these conditions 
(Johnson et al., 1998). The metal ion stabilizes the folded 
structure of CBD N1 but does not alter its affinity for eel- 
^oligosaccharides. In contrast, CBD N2 does not bind cal- 
cium appreciably. 



completion of the data collection with either CBDmi or 
CBD N2 bound to the paramagnetic cellooiigosaccharide, 
the nitroxide functionality was reduced to the diamag- 
netic hydroxy lamine by the addition of two molar 
equivalents of solid L-ascorbic acid (Sigma). The pH* qt 
the sample was re-adjusted to 6.1 and the spectra necessr 
ary to measure &R Y and A.R 2 were recorded. All spectra 
were acquired as 1024 x 96 complex points with spectral 
widths of 6500 and 1450 Hz in the *H and 15 N dimen- 
sions, respectively. Selective water flip back pulses were 
utilized "to ensure minimum perturbation of the water 
magnetization (Zhang et al, 1994; Grzesiek & Bax, 1993). 



Titrations with TEMPO-labeled eel loo I igo sac char ides 

The binding of the TEMPO-labeled cellooligosacchar- 
ides to the CenC CBDs at pH* 6.1 and 35 °C was 
measured quantitatively using l H- 15 N HSQC spec- 
troscopy. Stock solutions of TEMPOGlc, and TEMPO 
Glc 4 were prepared by weight in the above buffer and 
added in small aliquots to the protein samples. The 
titration of CBD N1 with TEMPO-Glc 3 was carried out by 
progressively adding the labeled sugar to 0.32 mM pro- 
tein up to a final ligand-to-protein ratio of 22:1. The 
titrations of 0.65 mM CBD N] and 0.48 mM CBD N2 with 
TEMPO-Glc 4 were taken to final corrected ligand-to-pro- 
tein ratios of 2.2:1 and 1.4:1, respectively. Protein concen- 
trations were measured using £ 2 no = 21370 M" cm" 
and 20500 M _l cm" 1 for CBD N1 and CBD^, respectively 
(Johnson et al., 1996a). Equilibrium association binding 
constants were determined by non-linear least-squares 
fitting of the J H N and 15 N chemical shifts of numerous 
amide protons versus cellooiigosaccharide concentration 
to the Langmuir isotherm describing the binding of one 
ligand molecule to a single protein site(Johnson et al, 
1996a). These amide protons, which include Gly7, GlylS, 
Val34, Gly44, Thr65, Gln80, AsnSl, Thr87 and Glyl30 for 
CBD N1 and Vall97, Tyrl98, Gly231, Tyr234 and Aia237 
for CBD N2 , exhibit fast exchange between the free and 
bound states on the chemical shift time-scale and are 
detectable, albeit with diminishing intensity, over the 
course of the entire titration. In the cases of the TEMPO- 
Glc 4 titrations, good fits of the experimental data points 
to the binding isotherms were obtained only if scaling 
factors of ~1.3 (for CBD N2 ) or 1.8 (for CBD N1 ) were intro- 
duced to correct for apparent errors in measuring the 
weight of the sugar (~1 mg) during the preparation of 
stock titrant solutions. The reported binding constants 
and errors are the averages and standard deviations of 
the individual values measured for each *H N and 15 N 
nuclei. 



NMR spectroscopy 

Spectra were acquired on a Varian Unity 500 MHz 
spectrometer at 35 °C. Each point of the titration of the 
^N-labeled proteins with TEMPOGlc 3 or Glc 4 was mon- 
itored bv the acquisition of a sensitivity-enhanced gradi- 
ent l U- U N HSQC spectrum (Kay et al, 1992). Upon 
formation of the sugar-protein complexes, 1 H N T, values 
were measured using a sensitivity-enhanced gradient 
X H- 15 N HSQC sequence as a read-out of a non-selective 
inversion-recovery experiment recorded with delays of 
t = 0, 0.1. 0.2, 0.4, 0.8 and 2 seconds. The t = 0.1 spec- 
trum was repeated to help estimate experimental error. 
A non-sensitivity-enhanced gradient ] H- 15 N HSQC spec- 
trum was then recorded to obtain the spin-spin relax- 
ation enhancements, AR 2 , of the amide l W* nuclei. After 



Calculation of AR, and AR 2 

NMR spectra were analyzed using a combination of 
Felix (Biosym Technologies) and MMRPipe (Delaglio 
et al, 1995). Data for the J H Tj series were processed 
with mild Lorentzian-to-Gaussian apodization. The rela- 
tive peak volumes, V t , at each inversion-recovery delay t 
were fit .to the function V t = V 0 exp(-f/T J, as described 
by Farrow et al (1994). V„ is the volume at time t = 0, 
and errors in the measured 7, lifetimes were estimated 
using Monte Carlo simulations. The paramagnetic 
enhancement of each amide proton spin-lattice relaxation 
rate, A£ t , was calculated as: 



ARi 



111 



Ox and red indicate the data collected with the labeled 
cellooligosaccharides in the nitroxide and hydroxylamii|e 
forms, respectively. 

The non-sensitivity enhanced 1 H- 15 N HSQC spectra 
were processed without apodization in the proton 
dimension and with a mild exponential linebroadening 
window in the nitrogen dimension. Two methods were 
used to measure individual values of AR 2 . In the first, 
the ratio of the peak volumes recorded for an amide in 
the presence of the nitroxide (V ox ) versus the hydroxyla- 
mine-labeled ( V^) cellooiigosaccharide is given by: 



ox 



red 
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where f= 10. 1 ms is the total time during the INEPT 
and reverse-INEPT components of the HSQC pulse 
sequence in which J H N magnetization is in the transverse 
plane akd thus subject to paramagnetic relaxation 
A non-gradient enhanced HSQC was utilized to avoid . 
possible complications due to the use of two reverse- 
INEPT sequences. By measuring total peak volumes, and \ 
not heights, the effect of differential T 2 relaxation during 
the detection periods of the spectra recorded in the pre- 
sence of the oxidized and reduced sugars can be neg- 
lected. Peak volumes were determined by direct 
integration over a manually adjusted "footprint" and 
not by fitting of an idealized peak shape. Equation (2) 
can be rearranged to obtain AR 2 vo i : 

(*L-(iL-GMfc) f 

In the second method, AR 2rL w were obtained from the 
change in the linewidths of the 'H- 15 N cross-peaks in the 
proton dimension: 

AR 2 .LW - "(^) red = mLW ° X " LWred) 



0£0 



where LW ox and LVV^ are the full half-height line- 
widths in the presence of the nitroxide and hydroxy la- 
mine-labeled ceilooligosaccharides, respectively. The 
line-widths were measured by non-linear least-squares 
fitting of vectors, extracted in the proton dimension 
through a given cross-peak, to the equation for a single 
Lorentzian peak split by a 3 /hn-Hc coupling using the 
program PLOTDATA (TRIUMPH, UBC, Vancouver, 
Canada). The coupling constant was either treated as an 
independent variable or fixed at the value measured pre- 
viously (Johnson et a!., 1996b) using the HNHA or 
HMQC-I experiments (Bax et aL, 1994; Kay & Bax, 1990). 
Significant errors in the measurement of aR 2 .lw arise if 
this coupling is ignored. The effects of the free electron 
on the relaxation rates of the amide nitrogens are neg- 
lected due to the small magnetogyric ratio of the 15 N 
nucleus relative to that of *H. 



Calculation of r att 

The magnetic interaction of an unpaired electron 
and a proton is described by the modified Solomon- 
Bloembergen equations (Solonon & Bloembergen. 
1956).The enhancement of the longitudinal (AR^ and 
transverse (AR 2 ) relaxation rates of a proton due to the 
spin label are given by the equations (Kosen, 1989; 
Gellespie & Shortle, 1997a): 



^ 1 = A (^) = ?(rrfe) 



(5) 



(6) 



where K is the constant 1.23 x 10" 32 cm 6 s" 2 for a nitrox- 
ide radical, r is the distance between the electron and 
proton, t c is the correlation time for the fluctuation of the 
electronic-nuclear dipole-dipole interaction, and udh is 
the Larmor frequency of the proton. These equations are 
based on the assumptions that the vector between the 
electron and proton is free to undergo isotropic 
rotational diffusion, and that the distance r is constant. 
The value of x c for each individual amide can be deter- 
mined directly from the ratio of AR 2 to ARj by combin- 
ing equations (1) and (2) to obtain: 



Xr — 
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With a value of t c in hand, the distance between the pro- 
ton and electron, r, is calculated by rearranging equation 

(2): 



^ " AR 2 V c (1 + 



(8) 



Individual values of t c , rather than a global average, 
were utilized to reflect directly the experimentally 
measured relaxation enhancements for each amide. For 
an ensemble of interconverting protein-ligand com- 
plexes, the measured distances will reflect the weighted- 
average of the individual electron-proton distances and 
thus are denoted as r rff . 

Standard deviations for &R lf AR 2 . *c and r cff were cal " 
culated by conventional error propagation (Bevington & 
Robinson, 1992). 



Structure m deling 

The positions of the nitroxide moiety in TEMPOGlc 4 
bound to CBDm! or CBD N2 were estimated using a 
simple restrained minimization protocol provided witji 
X-PLOR r v.3.8 (Brunger, 1992). The only attractive force 
was the- -soft-square potential of the distance restraint* 
energy term (£ NO e) and me oru Y repulsive force was that 
corresponding to van der Waals interactions (£ V dw)- The 
bounds on the effective electron-proton distances, r c£f , 
were defined for three cases. First, for eight amide pro- 
tons in CBDn! and eight in CBD N2 for which AR 2 values 
could not be obtained due to severe paramagnetic line- 
broadening (thus r rff < 11 A) , the distance bounds were 
set to the range of 1-11 A. Second, for 27 amide protons 
in CBD M1 and 26 in CBD^ for which the measured r rff 
were 11 to 15 A, the distances were restrained to 
r rff ± 3 A. Third, for 54 amides in CBD N i and 59 in 
CBD N2 with r cff >15A, the distance bounds were set 
from 13 A to an arbitrarily high value of 45 A. Although 
distances between 11 and 23 A were measured for 
CBD N i and CBD tg2 (Figure 6), it was found that those 
>15 A were consistently underestimated in light of the 
dimensions of these proteins. Therefore, r M values great- 
er than 15 A were used only to derive lower bounds to 
restrain the nitroxide atoms from approaching the corre- 
sponding amide protons in the two protein structures. 

To account for multiple positions of TEMPOGlc 4 
bound to CBDm! or CBD^, ensemble-averaging (Bonvin 
& Brunger, 1996) was applied to coordinates systems 
containing one, two, or three nitroxides, defined as single 
atoms with a mass of 30 amu and a van der Waal radius 
of 25 Ar The co-ordinates of the protein (pdb file 1ULO 
for CBDjj, and a low-energy structure for CBD N2 
(unpublished)) were held fixed, and each ensemble of 
nitroxide groups was minimized 60 times starting from 
random positions. The quality of the calculated struc- 
tures were estimated by the NOE energy penalty due tP 
distance restraint violations. 
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